3574 results found.
Written
Evaluation Data,
Language Type:
Monolingual
Languages:
English
Availability:
Freely Available
License:
Size:
None Production Status:
Existing-used
Use:
Information Extraction, Information Retrieval
-
Paper title:KeyGames: A Game Theoretic Approach to Automatic Keyphrase Extraction
-
Paper track:Long paper/
-
Paper status:Accept Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Mudit Mangal | Inspec Database Keyword Extraction Data Set | /N |
Documentation:
None
Written
Corpus,
Language Type:
Monolingual
Languages:
English
Availability:
Freely Available
License:
Size:
170 texts / 23,346 coreference chains OtherProduction Status:
Newly created-finished
Use:
Discourse
-
Paper title:A Straightforward Approach to Narratologically Grounded Character Identification
-
Paper track:Long paper/
-
Paper status:Accept Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Mark Finlayson | Character Annotations on Existing Corpora | /N |
Documentation:
None
Written
Corpus,
Language Type:
Bilingual
Languages:
English French
Availability:
Not Available
License:
Size:
140M sentences Production Status:
Existing-updated
Use:
Machine Translation, SpeechToSpeech Translation
-
Paper title:Human or Neural Translation?
-
Paper track:Long paper/
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Phillippe Langlais | Translation Memory | /N |
Documentation:
None
Speech/Written
Lexicon,
Language Type:
Multilingual
Languages:
Czech English Finnish French German Russian
Availability:
Freely Available
License:
Size:
206, 395 sentences Production Status:
Existing-used
Use:
Machine Translation, SpeechToSpeech Translation
-
Paper title:Understanding Pure Character-Based Neural Machine Translation: The Case of Translating Finnish into English
-
Paper track:Long paper/
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Gongbo Tang | MuCow | /N |
Documentation:
None
Written
Evaluation Data,
Language Type:
Monolingual
Languages:
English
Availability:
Freely Available
License:
Creative Commons Attribution 4.0 (CC BY 4.0)
Size:
172073 entries Production Status:
Newly created-finished
Use:
Document Classification, Text categorisation
-
Paper title:Aspect-based Document Similarity for Research Papers
-
Paper track:Long paper/
-
Paper status:Accept Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Malte Ostendorff | Aspect-based Document Similarity for Research Papers | /N |
Documentation:
None
Written
Lexicon,
Language Type:
Monolingual
Languages:
English
Availability:
Freely Available
License:
Gnu
Size:
2587 words Production Status:
Newly created-finished
Use:
Lexicon Creation/Annotation
-
Paper title:NYTWIT: A Dataset of Novel Words in the New York Times
-
Paper track:Short paper/
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Yuval Pinter | New York Times Word Innovation Types v.1.1 | /N |
Documentation:
Within submission
Written
Corpus,
Language Type:
Monolingual
Languages:
English
Availability:
Freely Available
License:
CC BY-SA 4.0 license
Size:
40 MByte Production Status:
Existing-used
Use:
Question Answering
-
Paper title:ForceReader: a BERT-based Interactive Machine Reading Comprehension Model with Attention Separation
-
Paper track:Long paper/
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | zheng chen | Stanford Question Answering Dataset | /N |
Documentation:
None
Written
Corpus,
Language Type:
Monolingual
Languages:
English
Availability:
From Owner
License:
N/A
Size:
1 MByte Production Status:
Newly created-finished
Use:
Machine Learning
-
Paper title:AbuseAnalyzer: Abuse Detection, Severity and Target Prediction for Gab Posts
-
Paper track:Short paper/
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Mohit Chandra | GAB Abuse Dataset | /N |
Documentation:
We will provide the documentation in English
Multimodal/Multimedia
Corpus,
Language Type:
Monolingual
Languages:
English
Availability:
Freely Available
License:
N/A
Size:
5 GByte Production Status:
Existing-used
Use:
Text Mining
-
Paper title:AbuseAnalyzer: Abuse Detection, Severity and Target Prediction for Gab Posts
-
Paper track:Short paper/
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Mohit Chandra | pushshift.io/gab | /N |
Documentation:
N/A
Written
Corpus,
Language Type:
Monolingual
Languages:
English
Availability:
Freely Available
License:
OpenSource
Size:
74 MByte Production Status:
Existing-used
Use:
Natural Language Generation
-
Paper title:Automatic Distractor Generation for Multiple Choice Questions in Standard Tests
-
Paper track:Long paper/
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Zhaopeng Qiu | Distractor-Generation-RACE | /N |
Documentation:
None




